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Abstract 

We suggest that ensembles of self-replicating entities such as biological 



Cn| systems naturally evolve into a self-organized critical state in which fluc- 

^-H tuations, as well as waiting times between phase transitions ("epochs"), 

o 

^^ are distributed according to a 1// power law. We demonstrate these 

.^ concepts by analyzing a population of coexisting self-replicating strings 

*wV. (segments of computer code) subject to mutation and survival of the 

U 

O fittest. 
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Self-organized criticality [1] is the term generically applied to systems that drive 
themselves to a critical state that is robust to perturbations and whose macroscopic 
behavior is predictable to the extent that it follows power laws with exponents that 
depend only on geometry and the spatial structure. In general, the microscopic pro- 
cesses giving rise to the self-organized critical state are dissipative transport processes 
associated with a threshold, or critical, variable. The paradigm for the self-organized 
critical state is the sandpile: the critical state is the self-similar and robust pile itself, 
distribution of sizes and duration of avalanches (resulting from perturbations) follow 
distinct power laws, and grains of sand are transported if the local slope of the pile 
exceeds a critical value, thus restoring the critical state. 

It has been suggested [2-6] that biological populations are typically in a self- 
organized critical state, evidenced for example by a power-law distribution of extinc- 
tion events. Furthermore, it was observed that population structures gleaned from 
taxonomic data [7] show a fractal geometry. While this is a very appealing idea, espe- 
cially in view of the robustness of living systems, it has suffered from being somewhat 
vague, mainly because of the difficulty involved in modeling living systems. Specifi- 
cally, there is as yet neither a clear identification of the self-organized critical state of 
life or the agent that causes self-organization, nor a definition of a critical or threshold 
variable whose disturbance causes the ubiquitous avalanches giving rise to power-law 
distributions. 

In this Letter, we report the observation of self-organized criticality in an arti- 
ficial living system, the tierra environment [8,9]. In this system, strings of machine 
language-like instructions with the ability to self-replicate in core memory "live" and 
co-evolve in an environment subject to random mutation and selection of the fittest. 
As such, it is not a simulation of life but rather artificial life. Interestingly, the 
system displays some of the uncanny hallmarks known from simple proto-cellular 
systems. Most importantly, this artificial environment offers the chance to control 
the microscopic processes leading to complex behavior, such as replication and mu- 
tation. Furthermore, the macroscopic behavior is predictable via the usual methods 
of statistical mechanics applied to an ensemble of self-replicating entities. 

Simple equations [9] reveal that in this system the fitness of a string is deter- 
mined by its replication rate, as measured by executing the string's instructions (its 



"genome") and counting the number of offspring per unit time. The principle of 
"survival-of-the-fittest" boils down to a "survival-of-the-most-populous." Fitness is 
then a quantity that is genotype specific^ i.e., each individual arrangement of instruc- 
tions in a string translates into a specific replication rate. This is a unique feature that 
makes this system fundamentally different from the molecular quasi-species model of 
Eigen et al. [fO]. We can thus think of fitness as a highly complicated function on 
the space of all strings, while the population is characterized by the current average 
value. This state, however, is metastable: a successful mutation can create a new 
"best" genotype (or "master sequence" [fO]) with a higher replication rate that dis- 
rupts the equilibrium and induces a phase transition to a new "vacuum," defined by 
the new dominant master sequence and its offspring. 

Let Ci stand for the replication rate of genotype z, and (e) for its average over 
the population. In a mean-field approximation, for strings of length £ subject to 
a mutation rate R (we are considering here only external "cosmic-ray" mutations, 
which have an effect similar to copy-errors), the critical variable is the growth factor 

7, = e, - (e) - m . (I) 

In the equilibrium situation, the master sequence and its e-degenerate offspring and 
mutants have 7i = 0, while inferior species have 7i < 0. This prevents exponential 
growth of the most successful species in the long run. An advantageous mutation, 
however, can make 7i > for the new master sequence. Such a disturbance causes the 
information contained in the master sequence to be transmitted throughout the sys- 
tem via the offspring, giving rise to avalanches that are scale-independent. Gradually, 
all genotypes with a subcritical replication rate will become extinct and be replaced: 
the system returns to its critical state. Clearly, the normal state of such a population 
of self-replicating entities is a superposition of a very large number of metastable 
states, with transitions between them induced by mutation and copy-errors. It un- 
dergoes spontaneous phase transitions if a mutation creates a genotype with 7i > 0, 
ushering in a new "epoch" of domination by a new species. In fact, in such a system 
there is no scale that would set the average time between avalanches, nor is there a 
scale setting the size of the avalanche. The latter is determined by the amount of 
information gained by the new master sequence. We thus expect both distributions 
to be given by power laws. 



To test this hypothesis, we have analyzed the evolution of an ensemble of strings 
subject to a Poisson-random mutation rate of i? = 0.5 X 10~® mutations per site 
per unit time (the unit of time is the execution of one instruction). The strings 
are segments of computer code of a specially developed instruction set with only 32 
instructions running on a virtual computer. A mutated instruction most likely will 
cause the program to "break," yet occasionally may improve it. The strings live 
in a strip of memory with a total number of 131,072 sites that can represent one 
instruction each, with periodic boundary condition (i.e., the strip wraps on itself). It 
is typically inhabited by 600-1400 strings of length 60-150, all offspring of a single 
(handwritten) progenitor that is able to self-replicate and used to "inoculate" the 
strip. For purposes of reproducibility, we used the original "ancestor" written by Tom 
Ray [8], who created the tierra system just described. As described elsewhere [9], this 
ancestor is well-suited for evolutionary experiments due to the amount of redundancy 
in its code. The fitness landscape that this population explores is determined by all 
possible ways to reduce the time to gestate a single offspring (the gestation time) and 
the opportunity to trigger bonus CPU time by developing the "genetic code" necessary 
to perform certain user-specified tasks (see Ref. [9] for details on this environment). 
In other words, we provide an environment containing information that the strings 
can discover (through adaptive mutation) and exploit. 

The population adapts to this environment through discontinuous jumps, as evi- 
denced in Fig. 1. 
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FIG. 1 Fitness curve for a typical run. The fitness parameter a of the most 
successful (i.e., most populous) genotype is plotted as a function of time, 
measured every million instructions for a mutation rate R = 0.5 X 10~®. 



There, we have plotted the "fitness-of-the-best" a versus total number of instructions 
executed (i.e., time elapsed) every million instructions for a typical run. For technical 
reasons, the measured quantity a is the replication rate e multiplied by the total 
number of instructions allocated to the strings in one "sweep" through the population, 
with < a < 1. The latter bound is imposed only to maintain parallelism: in order to 
emulate parallel coexistence, each string is allocated a certain slice of CPU time and 
executed serially (see, e.g., Ref. [8] for details). The visible noise in Fig. 1 is mainly 
due to mutations and finite-size effects. This noise, of course, drives the fitness jumps 
that adapt the population to the environment. 




FIG. 2 Power spectrum P(/) of a typical fitness curve a(t) (Fig. I). The 
dashed line is a fit to P(/) ~ /"'^ with /3 = 2.0 ± 0.05. 



We have plotted in Fig. 2 the power spectral density of a typical fitness history, 
which reveals a clear power-law distribution with P{f) ~ /"'^ and /3 = 2.0 it 0.05. 
Scaling exponents from other runs are compatible within the error bars quoted. 

Fluctuations distributed according to a power law are the telltale sign of a self- 
organized critical state [I]. To further our understanding of this state we have also 
measured the distribution of waiting times between phase transitions, or length of 
epochs, in 50 runs under identical conditions (save the random number seed), resulting 
in 512 measured waiting times. 
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FIG. 3 Integrated distribution of times between phase transitions r 
(length of epoch). The sohd hne is a fit to the data with M[t) ~ 
T~" exp (—t/T) with a = 1.1 it 0.05 and a cut-off parameter T = 450 it 50 
modehng finite-size effects. 



Figure 3 shows the waiting time r plotted versus the integrated distribution function 

1 f^ 
M{t) = - / N{t)dt , (2) 

T Jt 

where N[t) is the distribution function. The integrated quantity is distributed with 
the same exponent as iV(t), but is more reliable even with poor statistics on N[t). 
To obtain the waiting times, we determined that a phase transition occurred if the 
fitness jumps discontinually to a new level with a fitness increase of a minimum of 
7.5%. As the resulting plot proves that the fitness curves are fractal, this condition 
cannot change the power law. Rather, fitness curves like the one in Fig. 1 are expected 
to look similar at all scales. Clearly, as we cannot measure waiting times t > 500 [11] 
with good statistical accuracy due to the finite lengths of our runs, the integrated 
distribution function shows finite-size effects that we model with a cut-off parameter 



T. Thus, we fit 



M{t) 



c T exp I 



T' 



(3) 



and find a = 1.1 it 0.05 and T = 450 it 50. The critical exponents reported here do 
not allow for an unambiguous determination of the universality class of the model 
reported here, of the effective dimension of the space occupied by the strings. A 
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manifestly two-dimensional version of the model and a mean-field theory describing 
it are in preparation. 

We have suggested that the normal state of an ensemble of self-replicating enti- 
ties is self-organized criticality, the agent of self-organization being information. We 
identified the growth factor ^i as the critical variable and described avalanches of 
"invention" that drive the adaptation of the population. We tested these hypothe- 
ses in the artificial life system tierra and found power-law distributions in the power 
spectrum of fitness fiuctuations, as well as in the distribution of waiting times. Self- 
organized criticality in living systems has wide-ranging consequences for theories of 
evolution. On the one hand, gradualism is incompatible with criticality, and a punc- 
tuated equilibrium picture is favoured (see, e.g., Ref. [12]). On the other hand, the 
fractal nature of the fitness history (Fig. 1) would account for fitness improvements 
on all scales driven only by microscopic mutations. 

Note added: After completion of this manuscript, we noticed the appearance of 
Ref. [13], wherein conclusions similar to ours are drawn from a simple evolutionary 
model. 
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